Search CORE

9 research outputs found

Random forests with random projections of the output space for high dimensional multi-label classification

Author: D. Achlioptas
D. Kocev
E.J. Candes
F. Pedregosa
G. Madjarov
G. Tsoumakas
G. Tsoumakas
J. Read
J.L. Faulon
L. Breiman
P. Geurts
W.B. Johnson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

We adapt the idea of random projections applied to the output space, so as to enhance tree-based ensemble methods in the context of multi-label classification. We show how learning time complexity can be reduced without affecting computational complexity and accuracy of predictions. We also show that random output space projections may be used in order to reach different bias-variance tradeoffs, over a broad panel of benchmark problems, and that this may lead to improved accuracy while reducing significantly the computational burden of the learning stage

arXiv.org e-Print Archive

Crossref

Open Repository and Bibliography - Liège

PMG: Multi-core metabolite identification

Author: Boer F. de
Faulon J.L.
Hankemeier T.
Jaghoori M.M.
Jongmans S.S.T.Q.
Peironcely J.E.
Reijmers T.H.
Publication venue: 'Elsevier BV'
Publication date: 25/12/2013
Field of study

Distributed computing has been considered for decades as a promising way of speeding up software execution, resulting in a valuable collection of safe and efficient concurrent algorithms. With the pervasion of multi-core processors, parallelization has moved to the center of attention with new challenges, especially regarding scalability to tens or even hundreds of parallel cores. In this paper, we present a scalable multi-core tool for the metabolomics community. This tool addresses the problem of metabolite identification which is currently a bottleneck in metabolomics pipeline.Analytical BioScience

Leiden University Scholary Publications

Novel techniques for automorphism group computation

Author: B.D. McKay
G. Tener
H. Katebi
H. Katebi
J.-L. Faulon
J.L. López-Presa
T. Czajka
T. Junttila
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Graph automorphism (GA) is a classical problem, in which the objective is to compute the automorphism group of an input graph. In this work we propose four novel techniques to speed up algorithms that solve the GA problem by exploring a search tree. They increase the performance of the algorithm by allowing to reduce the depth of the search tree, and by effectively pruning it. We formally prove that a GA algorithm that uses these techniques correctly computes the automorphism group of the input graph. We also describe how the techniques have been incorporated into the GA algorithm conauto, as conauto-2.03, with at most an additive polynomial increase in its asymptotic time complexity. We have experimentally evaluated the impact of each of the above techniques with several graph families. We have observed that each of the techniques by itself significantly reduces the number of processed nodes of the search tree in some subset of graphs, which justifies the use of each of them. Then, when they are applied together, their effect is combined, leading to reductions in the number of processed nodes in most graphs. This is also reflected in a reduction of the running time, which is substantial in some graph families

Crossref

Archivo Digital UPM

Visual Network Analysis of Dynamic Metabolic Pathways

Author: A. Ullrich
A. Ullrich
B.O. Palsson
C. Klukas
D. Weininger
E.R. Gansner
G. Battista Di
G. Caetano-Anollés
H. Wiener
J. Branke
J.C. Roberts
J.L. Faulon
K. Misue
K. Sugiyama
M. Albrecht
M. Rohrschneider
N.H. Horowitz
R. Rao
R. Steuer
R.A. Jensen
S. Diehl
U. Brandes
Y. Frishman
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Abstract. We extend our previous work on the exploration of static metabolic networks to evolving, and therefore dynamic, pathways. We apply our visualization software to data from a simulation of early metabolism. Thereby, we show that our technique allows us to test and argue for or against different scenarios for the evolution of metabolic pathways. This supports a profound and efﬁcient analysis of the structure and properties of the generated metabolic networks and its underlying components, while giving the user a vivid impression of the dynamics of the system. The analysis process is inspired by Ben Shneiderman’s mantra of information visualization. For the overview, user-deﬁned diagrams give insight into topological changes of the graph as well as changes in the attribute set associated with the participating enzymes, substances and reactions. This way, “interesting features” in time as well as in space can be recognized. A linked view implementation enables the navigation into more detailed layers of perspective for in-depth analysis of individual network conﬁguration

CiteSeerX

Crossref

Repository: Freie Universität Berlin (FU), Math Department (fu_mi_publications)

Application of Conformal Prediction in QSAR

Author: G. Shafer
H. Dragos
H. Papadopoulos
J. Huuskonen
J. Jaworska
J.H. Drie van
J.L. Faulon
J.L. Faulon
J.L. Hintze
T.A. Halgren
T.I. Netzeva
Z. Bosnić
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Part 4: First Conformal Prediction and Its Applications Workshop (COPA 2012)International audienceQSAR modeling is a method for predicting properties, e.g. the solubility or toxicity, of chemical compounds using statistical learning techniques. QSAR is in widespread use within the pharmaceutical industry to prioritize compounds for experimental testing or to alert for potential toxicity. However, predictions from a QSAR model are difficult to assess if their prediction intervals are unknown. In this paper we introduce conformal prediction into the QSAR field to address this issue. We apply support vector machine regression in combination with two nonconformity measures to five datasets of different sizes to demonstrate the usefulness of conformal prediction in QSAR modeling. One of the nonconformity measures provides prediction intervals with almost the same width as the size of the QSAR models’ prediction errors, showing that the prediction intervals obtained by conformal prediction are efficient and useful

Crossref

Thermodynamic Properties Of Asphaltenes: A Predictive Approach Based On Computer Assisted Structure Elucidation And Atomistic Simulations

Author: J.L. Faulon
M.S. Diallo
T. Cagin
W. A. Goddard Iii
Publication venue
Publication date: 01/01/2000
Field of study

INTRODUCTION Crude oil is a complex mixture of hydrocarbons and heteroatomic organic compounds of varying molecular weight and polarity [1]. A common practice in the petroleum industry is to separate crude oil into four chemically distinct fractions: saturates, aromatics, asphaltenes and resins [1--4]. Asphaltenes are operationally defined as the non-volatile and polar fraction of petroleum that is insoluble in n-alkanes (i.e., pentane). Conversely, resins are defined as the non-volatile and polar fraction of crude oil that is soluble in n-alkanes (i.e., pentane) and aromatic solvents (i.e., toluene) and insoluble in ethyl acetate. A commonly accepted view in petroleum chemistry is that asphaltenes form micelles which are stabilized by adsorbed resins kept in solution by aromatics [5,6]. Two key parameters that control the stability of asphaltene micelles in a crude oil are the ratio of aromatics to saturates and that of resins to asphaltenes.

CiteSeerX

The University of Manchester - Institutional Repository

Reverse engineering chemical structures from molecular descriptors: how many solutions?

Author: A. Bender
A. Bender
A.T. Balaban
C.J. Churchwell
D. Bonchev
D.A. Filimonov
D.M. Hawkins
H. Wiener
I.I. Baskin
J.-L. Faulon
J.-L. Faulon
J.-L. Faulon
J.-L. Faulon
J.-L. Faulon
J.L. Faulon
J.W. Godden
Jean-Loup Faulon
L.B. Kier
L.B. Kier
L.B. Kier
L.H. Hall
M. Randic
M.I. Skvortsova
M.I. Skvortsova
N.S. Zefirov
R.P. Sheridan
Shawn Martin
V. Kvasnicka
V. Venkatasubramanian
V.V. Poroikov
V.V. Zernov
W. Michael Brown
W. Tong
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Molecular modeling of the structure and properties of coal organic matter

Author: A. M. Gyul’maliev
A. Marzec
A. Marzec
A.I. Kamneva
A.M. Gyul’maliev
A.M. Gyul’maliev
A.M. Gyul’maliev
C.L. Spiro
C.L. Spiro
D.W. Krevelen Van
I.V. Kalechits
J.H. Shinn
J.L. Faulon
J.L. Faulon
K.L. Smith
M.I. Shchadov
M.R. Narkiewicz
S. G. Gagarin
S.G. Gagarin
S.G. Gagarin
S.G. Gagarin
S.G. Gagarin
S.G. Gagarin
S.G. Gagarin
S.G. Gagarin
S.M. Grigor’ev
T. Dong
T. Kabe
V.B. Artem’ev
W. Fuchs
Publication venue: 'Allerton Press'
Publication date
Field of study

Crossref

Virtual porous carbons: what they are and what they can be used for

Author: A. Buts
Acharya M.
Allen M.P.
Avnir D.
Bandosz T.J.
Bhatia S.K.
Biggs M.
Biggs M.
Biggs M.J.
Biggs M.J.
Biggs M.J.
Bojan M.J.
Brennan J.K.
Brown D.
Carrott P.J.M.
Chen X.S.
Christenson H.K.
Debye P.
Emmerich F.G.
Emmett P.H.
Everett D.H.
Faulon J.L.
Floess J.K.
Ford D.M.
Gavalda S.
Gefen Y.
Gelb L.D.
Harris P.J.F.
Jin W.
Jorge M.
Kaneko K.
Kaneko K.
Kruk M.
Kumar A.
Lastoskie C.
López-Ramon M.V.
M. J. Biggs
Mandelbrot B.B.
Marsh H.
Mezei M.
Muralidhar R.
Neimark A.V.
Nicholson D.
Nyden M.R.
Oberlin A.
Peterson T.
Petkov V.
Pfeifer P.
Pikunic J.
Pikunic J.
Pikunic J.
Proffen T.
Rodriguez J.
Rouquerol J.
Rouzaud J.N.
Rychlicki G.
Sahimi M.
Seaton N.A.
Segarra E.I.
Shevade A.V.
Shim H.S.
Smit B.
Steele W.A.
Stoeckli F.
Sweatman M.B.
Thomson K.T.
van Megen W.
Vishnyakov A.
Walton J.P.R.B.
Walton J.P.R.B.
Zetterström P.
Zhu Z.
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2006
Field of study

We use the term “virtual porous carbon” (VPC) to describe computer-based molecular models of nanoporous carbons that go beyond the ubiquitous slit pore model and seek to engage with the geometric, topological and chemical heterogeneity that characterises almost every form of nanoporous carbon. A small number of these models have been developed and used since the early 1990s. These models and their use are reviewed. Included are three more detailed examples of the use of our VPC model. The first is concerned with the study of solid-like adsorbate in nanoporous carbons, the second with the absolute assessment of multi-isotherm based methods for determining the fractal dimension, and the final one is concerned with the fundamental study of diffusion in nanoporous carbons.M. J. Biggs and A. But

Heriot Watt Pure

Crossref

Adelaide Research & Scholarship